Raised relativeAccuracy to 0.2 since 0.1 causes ~16% random failures at n=10,000 due to expected stddev fluctuations.#9588
Conversation
…failures at `n=10,000` due to expected `stddev` fluctuations.
relativeAccuracy to 0.2 since 0.1 causes `~16% random failures at n=10,000 due to expected stddev fluctuations.relativeAccuracy to 0.2 since 0.1 causes ~16% random failures at n=10,000 due to expected stddev fluctuations.
|
🎯 Code Coverage 🔗 Commit SHA: acd784f | Docs | Was this helpful? Give us feedback! |
BenchmarksStartupParameters
See matching parameters
SummaryFound 0 performance improvements and 0 performance regressions! Performance is the same for 51 metrics, 8 unstable metrics. Startup time reports for petclinicgantt
title petclinic - global startup overhead: candidate=1.54.0-SNAPSHOT~acd784fb1b, baseline=1.54.0-SNAPSHOT~7b1d89d384
dateFormat X
axisFormat %s
section tracing
Agent [baseline] (1.017 s) : 0, 1017408
Total [baseline] (10.769 s) : 0, 10768769
Agent [candidate] (1.024 s) : 0, 1023972
Total [candidate] (10.775 s) : 0, 10774979
section appsec
Agent [baseline] (1.194 s) : 0, 1194197
Total [baseline] (11.012 s) : 0, 11011837
Agent [candidate] (1.194 s) : 0, 1194402
Total [candidate] (10.938 s) : 0, 10937943
section iast
Agent [baseline] (1.154 s) : 0, 1154148
Total [baseline] (11.028 s) : 0, 11028120
Agent [candidate] (1.152 s) : 0, 1151990
Total [candidate] (11.117 s) : 0, 11117347
section profiling
Agent [baseline] (1.164 s) : 0, 1164463
Total [baseline] (11.038 s) : 0, 11037824
Agent [candidate] (1.162 s) : 0, 1161904
Total [candidate] (11.053 s) : 0, 11053002
gantt
title petclinic - break down per module: candidate=1.54.0-SNAPSHOT~acd784fb1b, baseline=1.54.0-SNAPSHOT~7b1d89d384
dateFormat X
axisFormat %s
section tracing
crashtracking [baseline] (1.453 ms) : 0, 1453
crashtracking [candidate] (1.454 ms) : 0, 1454
BytebuddyAgent [baseline] (686.254 ms) : 0, 686254
BytebuddyAgent [candidate] (690.914 ms) : 0, 690914
GlobalTracer [baseline] (257.415 ms) : 0, 257415
GlobalTracer [candidate] (258.957 ms) : 0, 258957
AppSec [baseline] (31.567 ms) : 0, 31567
AppSec [candidate] (31.845 ms) : 0, 31845
Debugger [baseline] (6.37 ms) : 0, 6370
Debugger [candidate] (6.386 ms) : 0, 6386
Remote Config [baseline] (687.066 µs) : 0, 687
Remote Config [candidate] (681.662 µs) : 0, 682
Telemetry [baseline] (12.652 ms) : 0, 12652
Telemetry [candidate] (12.619 ms) : 0, 12619
section appsec
crashtracking [baseline] (1.455 ms) : 0, 1455
crashtracking [candidate] (1.45 ms) : 0, 1450
BytebuddyAgent [baseline] (709.473 ms) : 0, 709473
BytebuddyAgent [candidate] (709.345 ms) : 0, 709345
GlobalTracer [baseline] (249.156 ms) : 0, 249156
GlobalTracer [candidate] (249.754 ms) : 0, 249754
AppSec [baseline] (171.207 ms) : 0, 171207
AppSec [candidate] (171.143 ms) : 0, 171143
Debugger [baseline] (6.065 ms) : 0, 6065
Debugger [candidate] (6.027 ms) : 0, 6027
Remote Config [baseline] (624.564 µs) : 0, 625
Remote Config [candidate] (610.9 µs) : 0, 611
Telemetry [baseline] (9.949 ms) : 0, 9949
Telemetry [candidate] (9.88 ms) : 0, 9880
IAST [baseline] (25.093 ms) : 0, 25093
IAST [candidate] (25.075 ms) : 0, 25075
section iast
crashtracking [baseline] (1.455 ms) : 0, 1455
crashtracking [candidate] (1.456 ms) : 0, 1456
BytebuddyAgent [baseline] (808.711 ms) : 0, 808711
BytebuddyAgent [candidate] (807.473 ms) : 0, 807473
GlobalTracer [baseline] (248.412 ms) : 0, 248412
GlobalTracer [candidate] (248.099 ms) : 0, 248099
AppSec [baseline] (27.553 ms) : 0, 27553
AppSec [candidate] (27.341 ms) : 0, 27341
Debugger [baseline] (6.241 ms) : 0, 6241
Debugger [candidate] (6.188 ms) : 0, 6188
Remote Config [baseline] (607.521 µs) : 0, 608
Remote Config [candidate] (587.276 µs) : 0, 587
Telemetry [baseline] (8.373 ms) : 0, 8373
Telemetry [candidate] (8.138 ms) : 0, 8138
IAST [baseline] (31.77 ms) : 0, 31770
IAST [candidate] (31.729 ms) : 0, 31729
section profiling
ProfilingAgent [baseline] (101.95 ms) : 0, 101950
ProfilingAgent [candidate] (101.567 ms) : 0, 101567
crashtracking [baseline] (1.437 ms) : 0, 1437
crashtracking [candidate] (1.442 ms) : 0, 1442
BytebuddyAgent [baseline] (719.179 ms) : 0, 719179
BytebuddyAgent [candidate] (717.716 ms) : 0, 717716
GlobalTracer [baseline] (235.604 ms) : 0, 235604
GlobalTracer [candidate] (235.038 ms) : 0, 235038
AppSec [baseline] (31.173 ms) : 0, 31173
AppSec [candidate] (31.137 ms) : 0, 31137
Debugger [baseline] (6.511 ms) : 0, 6511
Debugger [candidate] (6.464 ms) : 0, 6464
Remote Config [baseline] (721.403 µs) : 0, 721
Remote Config [candidate] (734.657 µs) : 0, 735
Telemetry [baseline] (16.746 ms) : 0, 16746
Telemetry [candidate] (16.684 ms) : 0, 16684
Profiling [baseline] (102.547 ms) : 0, 102547
Profiling [candidate] (102.157 ms) : 0, 102157
Startup time reports for insecure-bankgantt
title insecure-bank - global startup overhead: candidate=1.54.0-SNAPSHOT~acd784fb1b, baseline=1.54.0-SNAPSHOT~7b1d89d384
dateFormat X
axisFormat %s
section tracing
Agent [baseline] (1.018 s) : 0, 1017855
Total [baseline] (8.679 s) : 0, 8678794
Agent [candidate] (1.018 s) : 0, 1017570
Total [candidate] (8.664 s) : 0, 8664239
section iast
Agent [baseline] (1.161 s) : 0, 1160771
Total [baseline] (9.292 s) : 0, 9292422
Agent [candidate] (1.15 s) : 0, 1150036
Total [candidate] (9.383 s) : 0, 9383035
gantt
title insecure-bank - break down per module: candidate=1.54.0-SNAPSHOT~acd784fb1b, baseline=1.54.0-SNAPSHOT~7b1d89d384
dateFormat X
axisFormat %s
section tracing
crashtracking [baseline] (1.46 ms) : 0, 1460
crashtracking [candidate] (1.438 ms) : 0, 1438
BytebuddyAgent [baseline] (685.389 ms) : 0, 685389
BytebuddyAgent [candidate] (685.82 ms) : 0, 685820
GlobalTracer [baseline] (257.218 ms) : 0, 257218
GlobalTracer [candidate] (257.472 ms) : 0, 257472
AppSec [baseline] (31.62 ms) : 0, 31620
AppSec [candidate] (31.557 ms) : 0, 31557
Debugger [baseline] (6.349 ms) : 0, 6349
Debugger [candidate] (6.312 ms) : 0, 6312
Remote Config [baseline] (686.209 µs) : 0, 686
Remote Config [candidate] (675.704 µs) : 0, 676
Telemetry [baseline] (14.219 ms) : 0, 14219
Telemetry [candidate] (13.369 ms) : 0, 13369
section iast
crashtracking [baseline] (1.482 ms) : 0, 1482
crashtracking [candidate] (1.453 ms) : 0, 1453
BytebuddyAgent [baseline] (814.013 ms) : 0, 814013
BytebuddyAgent [candidate] (806.626 ms) : 0, 806626
GlobalTracer [baseline] (249.404 ms) : 0, 249404
GlobalTracer [candidate] (246.892 ms) : 0, 246892
IAST [baseline] (32.152 ms) : 0, 32152
IAST [candidate] (30.839 ms) : 0, 30839
AppSec [baseline] (27.375 ms) : 0, 27375
AppSec [candidate] (28.262 ms) : 0, 28262
Debugger [baseline] (6.275 ms) : 0, 6275
Debugger [candidate] (6.165 ms) : 0, 6165
Remote Config [baseline] (607.354 µs) : 0, 607
Remote Config [candidate] (602.902 µs) : 0, 603
Telemetry [baseline] (8.398 ms) : 0, 8398
Telemetry [candidate] (8.296 ms) : 0, 8296
LoadParameters
See matching parameters
SummaryFound 3 performance improvements and 1 performance regressions! Performance is the same for 8 metrics, 12 unstable metrics.
Request duration reports for insecure-bankgantt
title insecure-bank - request duration [CI 0.99] : candidate=1.54.0-SNAPSHOT~acd784fb1b, baseline=1.54.0-SNAPSHOT~7b1d89d384
dateFormat X
axisFormat %s
section baseline
no_agent (4.368 ms) : 4314, 4422
. : milestone, 4368,
iast (9.885 ms) : 9719, 10052
. : milestone, 9885,
iast_FULL (14.202 ms) : 13916, 14487
. : milestone, 14202,
iast_GLOBAL (10.878 ms) : 10684, 11072
. : milestone, 10878,
profiling (8.639 ms) : 8509, 8769
. : milestone, 8639,
tracing (7.91 ms) : 7792, 8028
. : milestone, 7910,
section candidate
no_agent (4.326 ms) : 4279, 4373
. : milestone, 4326,
iast (9.397 ms) : 9243, 9551
. : milestone, 9397,
iast_FULL (14.465 ms) : 14172, 14758
. : milestone, 14465,
iast_GLOBAL (10.354 ms) : 10171, 10537
. : milestone, 10354,
profiling (9.038 ms) : 8879, 9196
. : milestone, 9038,
tracing (7.399 ms) : 7297, 7501
. : milestone, 7399,
Request duration reports for petclinicgantt
title petclinic - request duration [CI 0.99] : candidate=1.54.0-SNAPSHOT~acd784fb1b, baseline=1.54.0-SNAPSHOT~7b1d89d384
dateFormat X
axisFormat %s
section baseline
no_agent (36.681 ms) : 36394, 36968
. : milestone, 36681,
appsec (48.546 ms) : 48115, 48976
. : milestone, 48546,
code_origins (43.095 ms) : 42717, 43473
. : milestone, 43095,
iast (45.259 ms) : 44860, 45659
. : milestone, 45259,
profiling (48.371 ms) : 47931, 48812
. : milestone, 48371,
tracing (42.948 ms) : 42579, 43317
. : milestone, 42948,
section candidate
no_agent (36.249 ms) : 35960, 36539
. : milestone, 36249,
appsec (48.147 ms) : 47724, 48571
. : milestone, 48147,
code_origins (43.784 ms) : 43407, 44161
. : milestone, 43784,
iast (44.89 ms) : 44487, 45293
. : milestone, 44890,
profiling (48.233 ms) : 47805, 48662
. : milestone, 48233,
tracing (43.933 ms) : 43559, 44307
. : milestone, 43933,
DacapoParameters
See matching parameters
SummaryFound 0 performance improvements and 0 performance regressions! Performance is the same for 11 metrics, 1 unstable metrics. Execution time for tomcatgantt
title tomcat - execution time [CI 0.99] : candidate=1.54.0-SNAPSHOT~acd784fb1b, baseline=1.54.0-SNAPSHOT~7b1d89d384
dateFormat X
axisFormat %s
section baseline
no_agent (1.471 ms) : 1460, 1483
. : milestone, 1471,
appsec (3.713 ms) : 3497, 3930
. : milestone, 3713,
iast (2.193 ms) : 2131, 2256
. : milestone, 2193,
iast_GLOBAL (2.243 ms) : 2180, 2306
. : milestone, 2243,
profiling (2.039 ms) : 1989, 2090
. : milestone, 2039,
tracing (2.026 ms) : 1976, 2076
. : milestone, 2026,
section candidate
no_agent (1.469 ms) : 1457, 1480
. : milestone, 1469,
appsec (3.621 ms) : 3408, 3835
. : milestone, 3621,
iast (2.202 ms) : 2139, 2264
. : milestone, 2202,
iast_GLOBAL (2.239 ms) : 2176, 2302
. : milestone, 2239,
profiling (2.033 ms) : 1982, 2083
. : milestone, 2033,
tracing (2.02 ms) : 1970, 2069
. : milestone, 2020,
Execution time for biojavagantt
title biojava - execution time [CI 0.99] : candidate=1.54.0-SNAPSHOT~acd784fb1b, baseline=1.54.0-SNAPSHOT~7b1d89d384
dateFormat X
axisFormat %s
section baseline
no_agent (15.068 s) : 15068000, 15068000
. : milestone, 15068000,
appsec (15.037 s) : 15037000, 15037000
. : milestone, 15037000,
iast (18.56 s) : 18560000, 18560000
. : milestone, 18560000,
iast_GLOBAL (18.135 s) : 18135000, 18135000
. : milestone, 18135000,
profiling (15.633 s) : 15633000, 15633000
. : milestone, 15633000,
tracing (15.071 s) : 15071000, 15071000
. : milestone, 15071000,
section candidate
no_agent (15.228 s) : 15228000, 15228000
. : milestone, 15228000,
appsec (15.051 s) : 15051000, 15051000
. : milestone, 15051000,
iast (18.362 s) : 18362000, 18362000
. : milestone, 18362000,
iast_GLOBAL (18.001 s) : 18001000, 18001000
. : milestone, 18001000,
profiling (15.352 s) : 15352000, 15352000
. : milestone, 15352000,
tracing (15.117 s) : 15117000, 15117000
. : milestone, 15117000,
|
bric3
left a comment
There was a problem hiding this comment.
Couple of questions:
- Alternatively would it be better to retry ?
- I haven't looked deeply into the test, but I noticed increasing
relativeAccuracynumbers, is there a pattern to be careful to follow ?
Otherwise, that looks OK to me.
The odd part is that the second test uses a larger sample size |
|
Yeah that seemed odd to me for the same reasons you stated, so let's go ahead with this change :) |
What Does This Do
Raised
relativeAccuracyto0.2since0.1causes~16%random failures atn=10,000due to expectedstddevfluctuations.Motivation
Green CI.
Additional Notes
Fixed flaky test that failing with 16% probability.